fix: column aware row encoding: improve the implementation and add bench #17818

fuyufjh · 2024-07-26T02:08:56Z

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

Resolves #16763

Unfortunately, currently we don't ensure the persisted column IDs to be sorted. For example, create table t (a varchar, b int, c int) will create a state table that has column ids 1,2,3,0, where 0 always refers to _row_id.

As a result, I reverted the order-based implementation and use a hash-table now.

Benchmark result (on my MacBook)

Benchmarking bench_column_aware_row_encoding_decode: Collecting 100 samples in estimated 5.0018 s (1
bench_column_aware_row_encoding_decode
                        time:   [445.71 ns 448.84 ns 452.01 ns]
                        change: [-49.821% -49.204% -48.648%] (p = 0.00 < 0.05)
                        Performance has improved.

Why `ahash::HashMap`?

According to my benchmark, ahash::HashMap is the fastest implementation.

ahash::HashMap       445.47 ns
hashbrown::HashMap   509.70 ns
std::collections::HashMap   595.98 ns

Checklist

I have written necessary rustdoc comments
I have added necessary unit tests and integration tests
I have added test labels as necessary. See details.
I have added fuzzing tests or opened an issue to track them. (Optional, recommended for new SQL features Sqlsmith: Sql feature generation #7934).
My PR contains breaking changes. (If it deprecates some features, please create a tracking issue to remove them in the future).
All checks passed in ./risedev check (or alias, ./risedev c)
My PR changes performance-critical code. (Please run macro/micro-benchmarks and show the results.)

My PR contains critical fixes that are necessary to be merged into the latest release. (Please check out the details)

Documentation

My PR needs documentation updates. (Please use the Release note section below to summarize the impact on users)

Release note

If this PR includes changes that directly affect users or other significant modifications relevant to the community, kindly draft a release note to provide a concise summary of these changes. Please prioritize highlighting the impact these changes will have on users.

src/common/src/util/value_encoding/column_aware_row_encoding.rs

fuyufjh · 2024-07-26T08:38:43Z

The sorted implementation is still 10% faster than this hash-table implementation.

448.84 ns -> 403.56 ns (89.91%)

Considering the effort of refactoring the code and additional logic for compatibility, perhaps we can postpone it until necessary.

src/common/src/util/value_encoding/column_aware_row_encoding.rs

BugenZhao · 2024-07-26T08:48:08Z

src/common/src/util/value_encoding/column_aware_row_encoding.rs

@@ -176,7 +168,7 @@ impl ValueRowSerializer for Serializer {
 /// Should non-null default values be specified, a new field could be added to Deserializer
 #[derive(Clone)]
 pub struct Deserializer {
-    needed_column_ids: BTreeMap<i32, usize>,
+    required_column_ids: HashMap<i32, usize>,


Interesting. I thought BTreeMap is faster when there are only a few entries: https://arc.net/l/quote/okdycbqi

The current value of B is 6. According to this, can we also benchmark the case for a table containing less than 6 columns?

Added bench column_aware_row_encoding_4_columns. Observed a similar performance improvement.

Benchmarking column_aware_row_encoding_16_columns_encode: Collecting 100 samples in estimated 5.0015 column_aware_row_encoding_16_columns_encode time: [491.00 ns 492.30 ns 493.95 ns] change: [-3.6361% -2.8341% -2.0340%] (p = 0.00 < 0.05) Performance has improved. Found 10 outliers among 100 measurements (10.00%) 6 (6.00%) high mild 4 (4.00%) high severe Benchmarking column_aware_row_encoding_16_columns_decode: Collecting 100 samples in estimated 5.0011 column_aware_row_encoding_16_columns_decode time: [445.62 ns 448.76 ns 451.61 ns] change: [-51.073% -50.248% -49.515%] (p = 0.00 < 0.05) Performance has improved. Found 7 outliers among 100 measurements (7.00%) 3 (3.00%) low mild 2 (2.00%) high mild 2 (2.00%) high severe Benchmarking column_aware_row_encoding_4_columns_encode: Collecting 100 samples in estimated 5.0009 s (23M iter column_aware_row_encoding_4_columns_encode time: [221.45 ns 224.51 ns 228.98 ns] change: [+0.1302% +0.7332% +1.4856%] (p = 0.03 < 0.05) Change within noise threshold. Found 7 outliers among 100 measurements (7.00%) 2 (2.00%) high mild 5 (5.00%) high severe Benchmarking column_aware_row_encoding_4_columns_decode: Collecting 100 samples in estimated 5.0004 s (38M iter column_aware_row_encoding_4_columns_decode time: [131.47 ns 131.82 ns 132.26 ns] change: [-46.562% -46.074% -45.656%] (p = 0.00 < 0.05) Performance has improved. Found 11 outliers among 100 measurements (11.00%) 1 (1.00%) low mild 5 (5.00%) high mild 5 (5.00%) high severe

src/common/src/util/value_encoding/column_aware_row_encoding.rs

fuyufjh added 3 commits July 25, 2024 17:49

add benchmark

425e5f5

improve performance

8491e6d

use sorted algorithm & add assertion

f12a6b0

fuyufjh requested review from BugenZhao and zwang28 July 26, 2024 02:08

github-actions bot added the type/fix Bug fix label Jul 26, 2024

fuyufjh added the ci/main-cron/run-all label Jul 26, 2024

cargo clippy

c8ab9dc

BugenZhao reviewed Jul 26, 2024

View reviewed changes

src/common/src/util/value_encoding/column_aware_row_encoding.rs Outdated Show resolved Hide resolved

fuyufjh added 2 commits July 26, 2024 14:37

revert "use sorted algorithm & add assertion"

98a452d

tune performace

3254735

fuyufjh requested a review from BugenZhao July 26, 2024 08:32

fuyufjh marked this pull request as ready for review July 26, 2024 08:32

fuyufjh commented Jul 26, 2024

View reviewed changes

src/common/src/util/value_encoding/column_aware_row_encoding.rs Outdated Show resolved Hide resolved

graphite-app bot requested a review from a team July 26, 2024 08:52

BugenZhao approved these changes Jul 26, 2024

View reviewed changes

stdrc approved these changes Jul 30, 2024

View reviewed changes

bench 4 columns

8f62164

fuyufjh changed the title ~~fix: column aware row encoding: fix the implementation and add bench~~ fix: column aware row encoding: improve the implementation and add bench Jul 30, 2024

build a default row in advance

b76984c

fuyufjh enabled auto-merge July 30, 2024 08:21

Merge branch 'main' into eric/bench_column_aware_encoding

f84fc06

fuyufjh added this pull request to the merge queue Jul 30, 2024

Merged via the queue into main with commit 8984ae2 Jul 30, 2024
29 of 31 checks passed

fuyufjh deleted the eric/bench_column_aware_encoding branch July 30, 2024 16:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: column aware row encoding: improve the implementation and add bench #17818

fix: column aware row encoding: improve the implementation and add bench #17818

fuyufjh commented Jul 26, 2024 •

edited

Loading

fuyufjh commented Jul 26, 2024

BugenZhao Jul 26, 2024 •

edited

Loading

fuyufjh Jul 30, 2024

fix: column aware row encoding: improve the implementation and add bench #17818

fix: column aware row encoding: improve the implementation and add bench #17818

Conversation

fuyufjh commented Jul 26, 2024 • edited Loading

What's changed and what's your intention?

Benchmark result (on my MacBook)

Why ahash::HashMap?

Checklist

Documentation

Release note

fuyufjh commented Jul 26, 2024

BugenZhao Jul 26, 2024 • edited Loading

Choose a reason for hiding this comment

fuyufjh Jul 30, 2024

Choose a reason for hiding this comment

fuyufjh commented Jul 26, 2024 •

edited

Loading

Why `ahash::HashMap`?

BugenZhao Jul 26, 2024 •

edited

Loading